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Abstract 



Recent experiments have identified peptides with adhesion affinity for GaAs and Si 
surfaces. Here we use all-atom Monte Carlo (MC) simulations with implicit solvent to 
investigate the behavior in aqueous solution of four such peptides, all with 12 residues. 
At room temperature, we find that all the four peptides are largely unstructured, 
which is consistent with experimental data. At the same time, we find that one of 
the peptides is structurally different and more fiexible, compared to the others. This 
finding points at structural differences as a possible explanation for differences in 
adhesion properties between these peptides. By also analyzing designed mutants of 
two of the peptides, an experimental test of this hypothesis is proposed. 

Keywords: peptide folding, peptide adsorption to solid surfaces, all-atom model, 
Monte Carlo simulation 
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Introduction 



The advancing progress in manipulating proteins and non-biological macro- 
molecules and materials at the nanometer scale opens up possibilities for construct- 
ing novel hybrid materials with potential applications in bionanotechnology. ^'^ An 
important development in this direction is the identification of proteins that can 
bind to specific compounds. Over the last decade, genetic engineering techniques 
have been successfully employed to find peptides with affinity for, e.g., metals, ^'^ 
semiconductors^ and carbon nanotubes.^ However, the mechanisms by which pep- 
tides bind to these materials are not completely understood; e.g., it is unclear what 
role conformational changes play in the binding process. 

Here we report atomic- level simulations of the solution behavior of four 12- residue 
peptides, whose adhesion properties to (100) surfaces of GaAs and Si crystals were 
studied in recent experiments.^' ^'^ The main quantity measured in the experiments 
was the peptide adhesion coefficient (PAC), defined as the percentage of surface 
coverage, after drying and washing of the samples which were originally in contact 
with the peptide solution. This quantity was measured by AFM for the different 
peptide-substrate combinations,^'^ and was found to show a clear dependence on 
both peptide and substrate (see below). 

How the binding occurs in these peptide-surface systems is unclear. However, 
although the bound peptides were found to form clusters,^ it seems unlikely that 
the peptides aggregate before binding to the surface, because the hydrophobicity of 
the peptides studied is low and the peptide concentration was low, in the nanomolar 
range. A more accurate description is probably that the peptides bind one by one, 
a process that, in principle, can occur in two fundamentally different ways. One 
possibility is a docking behavior, where the peptides bind to the surface without un- 
dergoing any major conformational change. This scenario assumes that the peptides 
have a stable structure in solution, and that this structure matches the structure of 
the surface, e.g., with respect to polarization. The second variant is that the peptide 
is unstructured before binding occurs. Although the bound peptide structure need 
not be unique, the process would then have similarities with coupled folding-binding,^ 
which can be an efficient mode of binding compared to docking. Measurements of 
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circular dichroism (CD) spectra suggest that all the four studied peptides are largely 
unstructured in solution,^ thus favoring the second type of binding over docking. 

Recent studies have found that the adhesion propensity of peptides to various 
surfaces can be in part explained in terms of adhesion properties of their constituent 
amino acids/^'^^ However, the amino acid composition alone cannot explain the PAC 
values obtained experimentally for the four peptides studied here. In fact, two of these 
peptides share exactly the same amino acid composition, but still have quite different 
adhesion properties. In order to explain the adhesion properties, it might thus be 
necessary to take structural characteristics into account. However, as already indi- 
cated, the CD measurements did not reveal any clear structural differences between 
these peptides.^ 

The aim of our study is to get a more detailed picture of the behavior in aqueous 
solution of these peptides, and look for possible structural differences not seen in 
the CD analysis. A perfect model for folding simulations does not exist. It is worth 
noting, however, that the model we use,^"^'^^ despite a simplified energy function, is 
capable of folding both a-helical and /9-sheet peptides, without changing any model 
parameters. 

Three of the peptides we study have previously been simulated^^ using the 
ECEPP/3 force field. This study found only minor differences in folding behavior 
between these peptides. To further elucidate the structural properties of these pep- 
tides at room temperature, we here perform simulations using an alternative model, 
which has given realistic results for the stability and its temperature dependence for 
the peptides that it was able to fold.^^ 

Simulating the actual binding of the peptides to the surface is more challenging 
due to uncertainties about the precise form of the peptide-surface interactions and 
their dependence on solvation effects. ^^'^^ Nevertheless, such simulations have been 
performed for gold-binding peptides. ^° The phase structure for chain adsorption to 
attractive surfaces has been investigated using lattice models for polymers^^"^^ and 
peptides. Simphfied statistical-mechanical models have also been used to study 
molecular recognition of patterned surf aces, and conformational changes of pro- 
teins adsorbed to a solid surface.^^'^^ 
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Model and Methods 



Peptides Studied The four peptides we study are listed in Table 1, where also PAC 
values for (100) GaAs and Si surfaces can be found. The sequence SI was selected 
from a huge library of 12-mers for adhesion to GaAs.^ Its very poor propensity to 
adhere to Si is noteworthy. The sequence S2 is obtained from SI by exchanging two 
histidines for alanines. This double mutation leads to a reduced PAC for GaAs and 
a slightly increased PAC for Si. The peptide adhering best to the Si surface is S3, 
which is a random permutation of SI. The Si PAC is a factor 15 higher for S3 than 
for SI, despite that their amino acid composition is the same. The last sequence, S4, 
is derived from SI by replacing three asparagines by alanines. This change results in 
a shghtly reduced GaAs PAC and an increased Si PAC. 

Peptide Model The model we use contains all atoms of the peptide chain, includ- 
ing H atoms, but no explicit water molecules. It assumes fixed bond angles, bond 
lengths and peptide torsion angles (180°), so that each amino acid has the Ramachan- 
dran angles 0, and a number of side-chain torsion angles as its degrees of freedom. 
Here a brief presentation of the energy function will be given. Detailed descriptions 
of the parameterization of the geometry^^ and the different energy terms^^ can be 
found elsewhere. 

The energy function consists of four terms, 

E — Eev + E\oc + -Bhb + -E'hp ■ (1) 

The first term, E^y, represents excluded volume effects and is of the form 

12 

(2) 

where the sum is over all atom pairs. The parameters are atomic radii and \ij 
is a scale factor, which is 1.0 for pairs connected by three covalent bonds and 0.75 
otherwise. 

The second term represents an interaction between neighboring NH and CO par- 



Eev — f^ev ^ ^ 
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tial charges along the backbone. It is given by 



i=N,Hei 



(3) 



j=c,oei 

where the outer sum is over all amino acids and the are partial charges. 

The H bond contribution, i^hb, consists of two parts: backbone-backbone bonds 
and backbone-sidechain bonds, 



^hb = 4b Yl HnjMaij, Pij) + ejfj u{rij)v{aij, Pij) , 



(2) 



(4) 



bb-bb 



bb-sc 



where Vij denotes the HO distance, the NHO angle and Pij the HOC angle. The 
function u{r) is given by 



N 12 / \ 10 



(5) 



and the angular dependence is 



v{a,P) 



(cosacos/3)^/2 iia,p>90°, 
otherwise. 



(6) 



The last energy term, E^p, represents an effective hydrophobic attraction and has 
the form 

Ei,p^-J2MijCij, (7) 

KJ 

where the sum is over all pairs of nonpolar amino acids. The M/j (> 0) are constants 
that determine the strength of attraction between amino acids / and J. C/j is a 
geometric factor and a measure of the degree of contact between two side chains. It 
is defined as 



C 



1 



ij 



Ni + Nj 



V /(rninr^.) + /(minr^ ) 



jeAj 



(8) 



where Aj denotes a predefined set of Nj sidechain atoms for residue /. The function 
f{x) is given by f{x) = 1 if x < A, f{x) = if x > S, and f{x) = {B - x)/{B - A) 
iiA<x<B[A = (3.5 A)^ and B = (4.5 A)^]. 
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Simulation Method To investigate the solution behavior of the peptides S1-S4, 
we perform simulated-tempering^^' simulations with eight temperatures in the range 
275-369 K, and some reference runs at a constant temperature of 1 000 K. The con- 
formational updates we use are rotations of single backbone and sidechain torsion 
angles, and a semi-local backbone update, biased Gaussian steps (BGS),^^ which up- 
dates seven or eight consecutive angles in a manner that keeps the rest of the molecule 
approximately fixed. In the simulated-tempering runs these updates are called in dif- 
ferent proportions at different temperatures with more BGS at lower temperatures. 
At 299 K, the fractions of attempted single-angle backbone moves, sidechain moves 
and BGS are 0.29, 0.51 and 0.20, respectively. In the lOOOK simulations the corre- 
sponding fractions are 0.245, 0.51 and 0.245. 

Our simulations are carried out using the software package PROFASl,"^^ which is a 
C++ implementation of the above model. Each simulation comprises 10^ elementary 
MC steps. 

The results of our simulations are analyzed using multi-histogram techniques. 
All statistical uncertainties quoted are la errors obtained by the jackknife method. 

Results and Discussion 

Overall Structure and Temperature Dependence Cooperative structural ac- 
tivity is typically signaled by a peak in the statistical fluctuations of system rele- 
vant quantities, such as the energy. Figure 1 shows how the specific heat, Cy = 
d{E)/dT = ((-E^) - {E)^)/kBT^, and the temperature derivative of the radius of 
gyration, d{Rg)/dT, vary with temperature for the sequences S1-S4 ((•) denotes a 
Boltzmann average). The quahtative behavior of the three sequences SI, S2 and 
S4 is virtually identical. For all three sequences, the specific heat exhibits a broad 
peak with maximum around 280 K. The d{Rg) / dT curves show a similar broad peak, 
although the statistical errors are larger and the maximum is slightly shifted toward 
higher temperature. 

In the temperature regime where these peaks occur, it turns out that the 
secondary-structure content of these three sequences changes relatively rapidly. As 
the temperature decreases, the a-helix content, (na), increases, whereas the /3-strand 
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content, (n^j), decreases slightly, as can be seen from Fig. 2. These results indicate 
that the structures with lowest energy are a-helical for SI, S2 and S4. It should be 
noted, however, that the a-helix content remains small, < 0.25, all the way down to 
273 K. 

The sequence S3 shows a markedly different behavior. Neither Cy nor d{R^ / dT 
has a maximum within the temperature range studied; both quantities increase mono- 
tonically with decreasing temperature (see Fig. 1). Furthermore, the /3-strand con- 
tent remains larger than the a-helix content at low temperature for this sequence 
(see Fig. 2). The /9-strand content does not decrease with decreasing temperature, 
and the a-helix content increases much less than for the other sequences. 

Figure 3 shows typical low-energy conformations for the four different sequences, 
as obtained by simulated annealing.^^ As one might expect from the temperature 
dependence of the a-helix and /3-strand contents, the structure is a-helical for SI, S2 
and S4. However, the a-helix does not span the entire chain, but rather the region 
between residues 3 to 12. That the beginning of the sequence does not make a-helix 
structure is not unexpected, because there is a proline at position 4. The lowest- 
energy structure we find for S3 is a /5-hairpin. Its turn is at residues 6 and 7. The 
second strand of the /9-hairpin, spanning residues 8-12, is not perfect but broken in 
the vicinity of the proline at position 9. 

It must be stressed that the states illustrated in Fig. 3 are only weakly populated 
at room temperature, as is evident from the secondary-structure contents shown in 
Fig. 2. Our results are thus consistent with the CD analysis of the solution behavior 
of these peptides,^ at room temperature and pH 7.6, which suggests that they all are 
largely unstructured. 

Our conclusion that the ct-helix content, at low temperature, is higher than the 
/3-strand content for SI and S2, is in agreement with a previous study of S1-S3 based 
on the ECEPP/3 force field. However, in that study, the sequence S3 was found to 
be a-helical as well. Furthermore, the a-helix content of SI and S2 was significantly 
higher compared to what we find and to what is indicated by the CD results.^ 

Having studied the overall structure and the temperature dependence, we now 
turn to a more detailed structural description at T = 299 K, which is close to where 
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the CD measurements were taken. ^ This discussion will mainly focus on SI and S3, 
as the double mutant S2 and the triple mutant S4 show a behavior very similar to 
that of SI. 

Structural Characterization at T = 299 K To further elucidate the structure 
and free-energy landscape of these peptides, we analyze root-mean-square deviations 
(RMSD) from suitable reference structures (calculated over backbone atoms). We 
first consider an a-hehcal reference structure. The N-terminal part of SI is rather 
flexible due to a proline at position 4. Similarly, the C-terminal part of S3 is flexible, 
due to a proline at position 9. To reduce noise, we omit these tails when calculating 
RMSD. The reference structure used is an a-helix with 8 residues. With RMSD 
calculated this way, wc study the free energy F{A, E) as a function of RMSD, A, 
and energy, at 299 K. Figures 4a and 4b show contour plots of F{A,E) for SI 
and S3. For both sequences, the free-energy minimum is at an RMSD of about 3.4 A, 
which is approximately the average value for random structures, as obtained from 
control runs at lOOOK. This finding supports the conclusion that SI and S3 both 
are largely unstructured at 299 K. A clear local free-energy minimum corresponding 
to a-helix structure is missing for both sequences. For SI, there is, however, a valley 
from the global minimum in the direction of low RMSD and low energy, and there is a 
small but significant fraction of a-helical conformations with A ~ 1 A and relatively 
low energy. For S3, there is a valley in the same direction, but it is less pronounced, 
and conformations with a A as small as 1 A are rare. There is also a second valley 
for S3, where the lowest populated energies are found. The appearance of this second 
valley, where A > 3A, is not unexpected, given that the lowest-energy structure 
found for S3 is a /3-hairpin (see Fig. 3c). Figure 4c shows F(A, E) for S3 when this 
/3-hairpin is taken as the reference structure. A local minimum with A ~ lA and 
low energy can be found, but it is very weakly populated. The dominating global 
minimum corresponds to unstructured conformations. In fact, the average RMSD 
from the /3-hairpin for random S3 conformations, as obtained from a control run at 
1 000 K, is about 6 A, which is approximately where the global minimum is found at 
r = 299K. 

Next we examine how the a-helix and /3-strand contents (as defined in the caption 
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of Fig. 2) vary along the chains. Let Xa{i) = 1 if residue i is in the a-hehx state 
and Xa{i) — otherwise, so that (Xa(0) probabihty of finding residue i in 

the a-hehx state, and let X/3(i) denote the corresponding function for the /3-strand 
state. Figure 5 shows (Xa(i)) and (x^(i)) against i for S1-S4 at T = 299 K. The low- 
energy conformations of SI, S2 and S4 shown in Fig. 3 contain an a-hehx starting 
near position 3 and ending at the C terminus. The a-helix probability profile in 
Fig. 5a reveals that the stability of this ct-helix is not uniform along the chain; its 
N-terminal part is most stable, whereas the stability decreases significantly toward 
the C terminus. For S3, it can be seen from Fig. 5b that the {xpif)) values are similar 
in the two regions that make the strands of the /9-hairpin in Fig. 3c. An exception 
is Pro9, for which (x/3)(0 strictly zero (proline has a fixed = —65° in the model, 
which falls outside the interval in our /3-strand definition). We also note that the 
two end residues tend to be unstructured for all four sequences, with relatively small 
values of both (Xq(0) and {Xdii))- 

From the single-residue probabilities (Xa(0) (X/3(0)' '^'^^ cannot tell whether 
or not the formation of secondary structure is cooperative. To study that for SI, S2 
and S4, we calculate the helix-helix correlation coefficient for neighboring residues at 
T = 299 K, as defined by 



= ; . (9) 

where 



Ci^^-{Xa{i)Xa{j))-{Xa{i)){Xa{j)). (10) 

For all three peptides, we find that the largest r,|"|^ values occur in the region from 
i = 4 to i = 9 and are in the range 0.3-0.5. These values indicate that helix 
formation is a rather weakly cooperative process for these peptides. Consequently, 
the free-energy barrier to helix formation should be low, a conclusion that is in line 
with the results shown in Fig. 4a. For S3, r^-"!^ is about 0.3 or smaller for all i. 
The analogous strand-strand correlation coefficient r^-^li, defined in terms of Xpi'^)^ 
is smaller than 0.25 for all i for all the four sequences. 

Another way of analyzing secondary-structure correlations is to look at the typical 
lengths of unbroken a-helix and /3-strand segments. Specifically, we calculate the 



10 



fraction of conformations, at fixed T, that have at least one unbroken a-hehx 
strand) stretch with 3 residues or more, which we denote by (A^). Table 2 shows 
Xa and for SI and S3 at three different temperatures. For SI at T = 299 K, we find 
that Aq, = 0.12. This result can be compared with what one would expect if the Xa{i) 
were independent random variables with i-dependent individual distributions, given 
by Fig. 5a. In this uncorrelated case, it turns out that one would find Aq = 0.04. This 
comparison shows that the correlations are significant but not very strong. For S3, 
we find that Xg = 0.04 at T = 299 K. A calculation analogous to that for SI, shows 
that A^ = 0.04 is precisely what one would expect in the absence of correlations. 
Hence, we find that secondary-structure correlations are very weak for S3. 

Finally, it is also instructive to identify the backbone H bonds that are most likely 
to occur. We consider an H bond formed if its energy is < ~e^^/3. For SI, we find 
that the bonds NH(Asp6)-CO(Asn3) and NH(Asn7)-CO(Asn3) occur in ^ 38% and 
~ 34 % of the conformations, respectively, at T = 299 K, whereas no other backbone 
H bond has a frequency of occurrence above 15 %. These results confirm that the 
a-helix seen in low-energy conformations for SI is most stable in its N-terminal part. 
Note also that in our simulations this helix often starts with a fork-like H bonding; 
the C0(Asn3) group acts as an acceptor for two bonds. For S3, there is only one 
backbone H bond that occurs in more than 15 % of the conformations at T = 299 K, 
namely NH(Asnll)-C0(Ala8) with a frequency of occurrence of ?a 21 %. The paucity 
of H bonds underscores the notion that this peptide is highly fiexible. 

Two Other Sequences Why do we find a different behavior for S3? A major 
reason is the different position of the proline; the proline residue with its special 
geometry is at position 9 in the sequence S3, but at position 4 in SI, S2 and S4. To 
gauge the importance of the proline location, we repeated the same calculations for 
a variant of S3, S3', with Asp4 and Pro9 interchanged. We find that the behavior of 
S3' closely resembles that of SI, S2 and S4. As an example, we show in Fig. 5 the 
a- helix and /3-strand probability profiles for S3'. The S3' profiles are nearly identical 
to those for SI, S2 and S4. In the reshufSing of SI to get S3, the change of profine 
position thus seems particularly important. 

We also studied the sequence obtained by interchanging Pro4 and Thr9 in SI, 
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which we call SI'. We find that this transposition of SI leads to a behavior similar to 
that of S3, as is illustrated by Fig. 5, which confirms the importance of the position 
of the proline. 

Neither SI' nor S3' has, to our knowledge, been studied experimentally. 

Conclusions 

We have investigated the solution behavior of four synthetic peptides, S1-S4, that 
experimentally have been shown to exhibit specific adhesion properties to (100) GaAs 
and Si semiconductor substrates. Wc find that SI, the double mutant S2 and the 
triple mutant S4 all show a very similar behavior with respect to structure as well as 
thermodynamics. At room temperature, these peptides are largely unstructured, but 
have a small but significant a-helix content. For S3, which is a random permutation 
of SI, we find a different behavior. S3 is more fiexible than the other three peptides, 
with a very small content of both /3-strand and ct-helix structure. The lowest-energy 
structure we find for S3 is not a-helical but a /3-hairpin. 

In the experiments, S1-S4 all showed good adhesion to GaAs, especially SI. The 
main difference between the peptides was that S3, in contrast to the other three, 
adhered well to Si, too. 

Interestingly, our results suggest a clear difference in solution behavior between S3 
and the other three peptides. To what extent conformational differences can explain 
the different adhesion properties of these peptides remains to be seen. A possible 
test of this would be to determine the adhesion properties of the sequence S3', which 
in our model shows a solution behavior similar to that of SI, S2 and S4. It would be 
very interesting to see whether the adhesion properties of S3' resemble those of SI, 
S2 and S4, with similar conformational properties as S3', or whether they resemble 
those of S3, with 83 % sequence identity to S3'. 

Acknowledgments This work was in part supported by the German Science Foun- 
dation (no. JA 483/24-1) and the Swedish Research Council. Support from the 
DAAD-STINT Personnel Exchange Programme and the John von Neumann Insti- 
tute for Computing (NIC), Forschungszentrum Jiilich (no. hlzll) is gratefully ac- 



12 



knowledged. 

References and Notes 

(1) Sarikaya, M.; Tamerler, C; Jen, A. K.-Y.; Schulten, K.; Baneyx, F. Nature 
Mat. 2003, 2, 577-585. 

(2) Gray, J. J. Curr. Opin. Struct. Biol. 2004, 14, 110-115. 

(3) Brown, S. Nature Biotechnol. 1997, 15, 269-272. 

(4) Sano, K.-L; Shiba, K. J. Am. Chem Soc. 2003, 125, 14234-14235. 

(5) Whaley, S. R.; English, D. S.; Hu, E. L.; Barbara, P. F.; Belcher, A. M. Nature 
2000, 405, 665-668. 

(6) Wang, S.; Humphreys E. S.; Chung S.-Y.; Delduco D. F.; Lustig, S. R.; Wang, 
H.; Parker, K. N.; Rizzo, N. W.; Subramoney, S.; Chiang, Y.-M.; Jagota, A. 
Nature Mat. 2003, 2, 196-200. 

(7) Goede, K.; Busch, P.; Grundmann, M. Nano Lett. 2004, 4, 2115-2120. 

(8) Goede, K.; Grundmann, M.; Holland-Nell, K.; Beck-Sickinger, A. G. Langmuir 
2006, 22, 8104-8108. 

(9) Dyson, H. J.; Wright, P. E. Curr. Opin. Struct. Biol. 2002, 12, 54-60. 

(10) Shoemaker, B. A.; Portman, J. J.; Wolynes, P. G. PNAS 2000, 97, 8868-8873. 

(11) Gupta, N.; Irback, A. J. Chem. Phys. 2004, 120, 3983-3989. 

(12) Willett, R. L.; Baldwin, K. W.; West, K. W.; Pfeiffer, L. N. PNAS 2005, 102, 
7817-7822. 

(13) Peelle, B. R.; Krauland, E. M.; Wittrup, K. D.; Belcher, A. M. Langmuir 
2005, 21, 6929-6933. 

(14) Irback, A.; Samuelsson, B.; Sjunnesson, F.; WaUin, S. Biophys. J. 2003, 85, 
1466-1473. 

13 



(15) Irback, A.; Mohanty, S. Biophys. J. 2005, 88, 1560-1569. 

(16) Gokoglu, G.; Bachmann, M.; Qelik, T.; Janke, W. Phys. Rev. E 2006, 74, 
041802. 

(17) Nemethy, G.; Gibson, K. D.; Palmer K. A.; Yoon, C. N.; Paterlini, G.; Zagari, 
A.; Rumsey S.; Scheraga, H. A. J. Phys. Chem. 1992, 96, 6472-6484. 

(18) Cormack, A. N.; Lewis, R. J.; Goldstein, A. H. J. Phys. Chem. B 2004, 108, 
20408-20418. 

(19) Liang, T.; Walsh, T. R. Phys. Chem. Chem. Phys. 2006, 8, 4410-4419. 

(20) Braun, R.; Sarikaya, M.; Schulten, K. J. Biomater. Sci., Polym. Ed. 2002, 13, 
747-757. 

(21) Vrbova, T.; Whittington, S. G. J. Phys. A 1996, 29, 6253-6264. 

(22) Bachmann, M.; Janke, W. Phys. Rev. Lett. 2005, 95, 058102. 

(23) Bachmann, M.; Janke, W. Phys. Rev. E 2006, 73, 041802. 

(24) Bachmann, M.; Janke, W. Phys. Rev. E 2006, 73, 020901(R). 

(25) Muthukumar, M. J. Chem. Phys. 1995, 103, 4723-4731. 

(26) Bratko, D.; Chakraborty, A. K.; Shakhnovich, E. I. Chem. Phys. Lett. 1997, 
280, 46-52. 

(27) Golumbfskie, A. J.; Pande, V. S.; Chakraborty A. K. PNAS 1999, 96, 11707- 
11712. 

(28) Bogner, T.; Degenhard, A.; Schmid, F. Phys. Rev. Lett. 2004, 93, 268108. 

(29) Kriksin, Y. A.; Khalatur, P. G.; Khokhlov, A. R. J. Chem . Phys. 2005, 122, 
114703. 

(30) Moghaddam, M. S.; Chan, H. S. J. Chem . Phys. 2006, 125, 164909. 



14 



(31) Zhdanov, V. P.; Kasemo, B. Proteins 1998, 30, 168-176. 

(32) Castells, V.; Yang, S.; Van Tassel, P. R. Phys. Rev. E 2002, 65, 031912. 

(33) Marinari, E.; Parisi, G. Europhys. Lett. 1992, 19, 451-458. 

(34) Lyubartsev, A. P.; Martsinovski, A. A.; Shevkunov, S. V.; Vorontsov- 
Velyaminov, P. N. J. Chem. Phys. 1992, 96, 1776-1783. 

(35) Favrin, G.; Irback, A.; Sjunnesson, F. J. Chem. Phys. 2001, 114, 8154-8158. 

(36) Irback, A.; Mohanty, S. J. Comput. Chem. 2006, 27, 1548-1555. The PROFASI 
package is freely available at |http : / / cbbp . thep . lu . se /activities/ prof asi [ 

(37) Ferrenberg, A. M.; Swendsen, R. H. Phys. Rev. Lett. 1989, 63, 1195-1198. 

(38) Miller, R. G. Biometnka 1974, 61, 1-15. 

(39) Kirkpatrick, S.; Gelatt Jr., C. D.; Vecchi, M. P. Science 1983, 220, 1297-1301. 

(40) DeLano, W. L. The PyMOL Molecular Graphics System (2002) on World Wide 



Web http: //www.pymol.org 



15 



Tables 



TABLE 1: The four sequences studied, and their PAC values for adsorption to 
(100) GaAs and Si surfaces (from Goede et al.®). S2 is a double His^Ala mutant of 
SI, S3 a random permutation of SI, and S4 a triple Asn— >Ala mutant of SI. 
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TABLE 2: The fraction Aq (A^) of conformations that have at least one continuous 
a-helix (/3-strand) segment of length 3 or more. 







SI 






S3 




r(K) 


275 


299 


369 


275 


299 


369 


Aq, 


0.21 


0.12 


0.03 


0.06 


0.04 


0.01 


A/. 


0.03 


0.03 


0.04 


0.04 


0.04 


0.03 
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Figure Captions 



Fig. 1. Temperature dependence of (a) the specific lieat Cy = d{E)/dT and (b) 
d{Rg)/dT, for the sequences S1-S4. Rg is the radius of gyration (calculated over all 
non-H atoms). 

Fig. 2. Temperature dependence of (a) the a-helix content (ua) and (b) the /3- 

strand content (n^), for the sequences S1-S4. We define a residue as a-helical if 
its Ramachandran angles (f) and ip satisfy (f) G (—90°, —30°) and ip G (—77°,— 17°), 
and ria denotes the fraction of the 10 inner residues that are a-helical. Similarly, 
n/3 is the fraction of the 10 inner residues with Ramachandran angles satisfying G 
(-150°, -90°) and ^ G (90°, 150°). 

Fig. 3. Typical low-energy conformations for (a) SI, (b) S2, (c) S3, and (d) S4. 
These structures were obtained as the lowest-energy structures in ten simulated an- 
nealing runs for each sequence, starting from random conformations. In each run, the 
temperature was decreased geometrically from 369 K to 0.7 K in 100 steps. At each 
temperature 100000 elementary MC steps were performed. Drawn with PyMOL.^° 

Fig. 4. Free energies -F(A, E) calculated as functions of RMSD, A, and energy, E, 
for SI and S3 at T = 299 K. The reference structure is either an a-helix or a /3-hairpin 
(see text). The contours are spaced at intervals of 1 ksT. Contours more than 6 ksT 
above the minimum free energy are not shown. The free energy F(A, E) is defined by 
P(A, E) oc exp(-F(A, E)/kBT), where P(A, E) is the joint probability distribution 
of A and E at temperature T. (a) RMSD from the a-helix for SI (calculated over 
residues 5-12). (b) RMSD from the a-helix for S3 (residues 1-8). (c) RMSD from 
the /3-hairpin for S3 (all residues). Note that the x scale is different in (c). 

Fig. 5. Secondary-structure profiles for S1-S4, SI' (AQNTSDNNPHTH) and S3' 
(TNHPHSNADTNQ) at T = 299 K. (a) The probability that residue i is in the a- 
helix state, {Xa{i)), against i. (b) The probability that residue i is in the /3-strand 
state, (x/3(^)), against i. The lines are only guides to the eye. 
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